Using Data Compressors to Construct Rank Tests

نویسندگان

  • Daniil Ryabko
  • Jürgen Schmidhuber
چکیده

New nonparametric rank tests for homogeneity and component independence are proposed, which are based on data compressors. For homogeneity testing the idea is to compress the binary string obtained by ordering the two joint samples and writing 0 if the element is from the first sample and 1 if it is from the second sample and breaking ties by randomization (extension to the case of multiple samples is straightforward). H0 should be rejected if the string is compressed (to a certain degree) and accepted otherwise. We show that such a test obtained from an ideal data compressor is valid against all alternatives. Component independence is reduced to homogeneity testing by constructing two samples, one of which is the first half of the original and the other is the second half with one of the components randomly permuted.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using data compressors to construct order tests for homogeneity and component independence

Nonparametric order tests for homogeneity and component independence are proposed, which are based on data compressors. For homogeneity testing the idea is to compress the word obtained by ordering the combined samples and writing the number of the sample in place of each element. H0 should be rejected if the string is compressed to a certain degree and accepted otherwise. We show that such a t...

متن کامل

Reducing Hardware Complexity of Wallace Multiplier Using High Order Compressors Based on CNTFET

   Multiplier is one of the important components in many systems such as digital filters, digital processors and data encryption. Improving the speed and area of multipliers have impact on the performance of larger arithmetic circuits that are part of them. Wallace algorithm is one of the most famous architectures that uses a tree of half adders and full adders to increase the speed and red...

متن کامل

Compression of Unicode Files

The increasing importance of Unicode for text files, for example with Java and in some modern operating systems, implies a possible doubling of data storage space and data transmission time, with a corresponding need for data compression. However it is not clear that data compressors designed for 8-bit byte data are well matched to 16-bit Unicode data. This paper investigates the compression of...

متن کامل

Modeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions

In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...

متن کامل

Modeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions

In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/0709.0670  شماره 

صفحات  -

تاریخ انتشار 2007